Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation
ثبت نشده
چکیده
1) Introduction Several speech processing algorithms assume the signal is stationary during short intervals (approximately 20 to 30 ms). This assumption is valid for several applications, but it is too restrictive in some contexts. This work investigates the application of adaptive signal processing to the problem of estimating the formant frequencies of speech. Two algorithms were implemented and tested. The first one is the conventional Least-Mean-Square (LMS) algorithm, and the second is the conventional Recursive Least-Squares (RLS) algorithm. The formant frequencies are the resonant frequencies of the vocal tract. The speech is the result of the convolution between the excitation and the vocal tract impulse response [Rabiner, 78], thus a kind of "deconvolution" is required to recover the formants. This is not an easy problem because one does not have the excitation signal available. There are several algorithms for formant estimation [Rabiner, 78], [Snell, 93], [Laprie, 94]. One popular scheme is to calculate an all-pole filter H(z) using the Linear Predictive Coding (LPC) technique based on Levinson-Durbin algorithm. The formants can be associated to the poles of H(z). The formants can be estimated through an adaptive algorithm as shown in [Hodgkiss, 81]. This work will present results for the conventional LMS and RLS algorithms.
منابع مشابه
Hierarchical approach to formant detection and tracking through instantaneous frequency estimation - Electronics Letters
Formant frequencies, represented by major peaks in the spectrum of speech signals, convey important information about speech. The authors propose a method for detecting the formants of voiced speech through ‘instantaneous frequency’ (IF) estimation using a recursive least square (RLS) algorithm. The accuracy of the technique is assessed by comparing it with conventional formant detection techni...
متن کاملWavelet ridge track interpretation in terms of formants
This paper proposes two new approaches for formant tracking using Fourier and wavelet ridges. The speech signal is decomposed into Time-Frequency representations issued from windowed Fourier transform and wavelet transform. Formant tracking is achieved by exploring ridges from time-frequency representation and imposing continuity constraints on formant trajectories. These approaches are validat...
متن کاملFormant-tracking Linear Prediction Models for Speech Processing in Noisy Enviroments
This paper presents a formant-tracking method for estimation of the time-varying trajectories of a linear prediction (LP) model of speech in noise. The main focus of this work is on the modelling of the non-stationary temporal trajectories of the formants of speech for improved LP model estimation in noise. The proposed approach provides a systematic framework for modelling the inter-frame corr...
متن کاملFormant-tracking linear prediction models for speech processing in noisy environments
This paper presents a formant-tracking method for estimation of the time-varying trajectories of a linear prediction (LP) model of speech in noise. The main focus of this work is on the modelling of the non-stationary temporal trajectories of the formants of speech for improved LP model estimation in noise. The proposed approach provides a systematic framework for modelling the inter-frame corr...
متن کاملSpeech formant frequency estimation: evaluating a nonstationary analysis method
The objective of this paper is to critically evaluate the performance of a nonstationary analysis method in tracking speech formant frequencies as they change with time due to the natural variations in the vocal-tract system during speech production. The method of instantaneous frequency estimation is applied to the tracking of speech formant frequencies to observe the time variations in the vo...
متن کامل